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ABSTRACT 


The Selective Reenlistment Bonus (SRB) program is designed to offer an 
attractive reenlistinent incentive to improve manning in critical skills. To efficiently 
manage the SRB program, a requirement exists to maintain MOS level estimating 
factors for use in projecting retention rate improvement as a function of SRB award 
level. This thesis formulates and solves a mathematical model which explains the 
variation in zone A retention rates as a function of SRB award level and other factors 
believed significant in the reenlistment decision. 

To allow for comparison of the estimating factors associated with the SRB 
variable across MOS, an overall projection model was developed. Stepwise multiple 
linear regression analvsis techniques were used on a subset of the enlisted MOS 
inventory in the model development phase of this analysis. The proposed overall 
model was then fitted to a second subset of MOS to validate the assumptions and 


effectiveness of the proposed linear model. 
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I. INTRODUCTION 


The Commander, United States Army. Military Personnel. Center 
(MILPERCEN), is responsible for developing and issuing policies, standards and 
procedures in the administration of the Selective Reenlistment Bonus (SRB) program. 
The SRB program is designed to olfer an attractive reenlistment incentive to improve 
manning in the most critical skills. A primary consideration in the management of the 
SRB program is the historic. effectiveness of an. SRB in improving retention in a 
particular skill. In this studv, the problem of measuring the historic effectiveness of the 
SRB program is modelled and solved using stepwise and ordinary least squares multiple 


linear regression analysis. 


A. PROBLEM STATEMENT 

The Commander, MILPERCIEN must recommend to the Deputy Chief of Staff 
for Personnel (DCSPER) those Military Occupational Specialties (MOS) which should 
be included in the SRB program. The criteria used to determine which MOS should be 
included in the SRB program are outlined in the form of several guidelines (specifically, 
Title 37 United States Code, section 308, Department of Defense (DOD) Directive 
1304.21 and DOD Directive 1304.22). Some criteria, such as replacement training 
costs, are easily quantified. Other criteria, such as the relative unattractiveness of each 
MOS compared to other military and civilian skills, are much more subiective. 

One criterion upon which the decision to include a particular MOS m the SRB 
program is based is the projected improvement in retention in response to the bonus 
awarded. There must be a reasonable prospect of enough improvement in retention to 
justify the projected cost of the bonus. Therefore, a requirement exists to maintain 
estimating factors for use in projecting retention rate improvement as a function of 
SRB award level. DOD directs that these factors be developed from actual experience 
under the SRB program. 

The improvement factors currently available are outdated and were developed 
without consideration to certain variables believed critical to an accurate projection of 


retention at the MOS level. 


В. BACKGROUND 

In September 1981, the DCSPER requested that the Commander, United States 
Army Concepts Analysis Agency (CAA) establish a study group to develop an 
improved methodology [or allocation of SRB funds. An intermediate goal of the study 
group was to quantify the effect of SRB on retention; that is, develop a set of 
historically based improvement factors. These factors were to replace similar 
improvement factors published by the Rand Corporation in September 1977 | КеГ |. 
The DCSPER suggested that the Rand factors were no longer valid, in light of more 
recent trends in retention, pay and civilian perception of military service. 

In August 1982, the study was completed by CAA. Included in their final report 
[Ref. 2] were a set of MOS and reenlistment zone specific SRB cffectiveness factors. 
These factors were said to represent the net change in retention rate for a given MOS 
brought on by a change in the SRB authorized that MOS. The factors were actually 
the estimated regression coeflictents of the carricr variable SRB in the multiple linear 
regression model used to explain retention rate behavior for all MOS during the 


previous five years. The specific model follows: 
Ү= В + ИХ] + Ву + Ху + Ха” (1.1) 


where: 
Y - retention rate 
Хү = SRB multiplier 
Ny = year 
3 = calender quarter 


41 = unemployment rate 
Z^ = Consumer Price Index 


€ = crror component with assumed distribution N( 0, c? | 


While the study group cautioned against using the retention improvement factors 
(estimated regression coefficient 01) for longer than two years, no provisions were 
made for the periodic re-estimation of those coefficients. Ilence, the current set of 
coefficients are a function of data which are at least five years old. Additionally, while 


diagnostics from the CAA model support a reasonably good fit to the data available. 
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no attempt was made by the CAA analysts to account for the effects of factors such as 
population demographics and promotion opportunity. 

The Deputy Chief of Staff for Plans (DCSPLANS), MILPERCEN submitted this 
problem, with the below stated objectives, to the Naval Postgraduate School, pursuant 
to a special thesis study / management program. Under this program, a participating 
Army student works with MILPERCEN to resolve a current problem and receives a 
follow-on assignment to the Personnel Center upon graduation. All research costs and 


other costs associated with thesis preparation are borne by MILPERCEN. 


Ce TED OBJECTIVE 

The objective of this study is to formulate a mathematical model which explains 
the variation in zone A enlisted retention rates over time at the MOS level of detail. 
Variables representing promotion opportunity to grades E5 and E6 and a variable 


representing SRB award level are to be considered as candidate explanatory variables. 


Dero DEL AND SOLUTION APPROACH 

The mathematical formulation proposed in this study 15 an ordinary least squares 
multiple linear regression model with higher order terms. It is our intention to 
carefully select our dependent and independent variables so that the model can be used 
in a predictive manner: given a set of outcomes on the explanatory variables, we wish 
to predict an outcome on our selected response variable with a measureable degree of 
precision. 

Our objective is to build a model which can predict zone A retention at the MOS 
level. It is likely therefore, that if each MOS subpopulation were studied 
independently, the carrier variables included in the final model (selected by some 
system of rules) would not be identical for each MOS. This situation, for our 
purposes, 1s not acceptable. 

The intentions of our user dictate that we select a best model and apply it for all 
MOS. As has already been mentioned, the SRB managers have used the estimated 
coefficient of the carrier variable SRB (we refer to this estimate henceforth simply as 
bi) to compare the effects on retention of varying the SRB level across several, or even 
all, MOS. Mosteller and Tukey [Ref. 3: pp. 315-331] warn that the coefficient of a 
carrier 1$ very dependent on it's costock. In our case, we will attempt to construct a 
model so that the carrier variable representing SRB is unrelated to any variable in the 


costock. The interpretation of the estimated coefficient as the effect of SRB level 
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changing while costock variables keep their same values 1s then reasonable at the MOS 
level. For comparisons to be made across different MOS however, we must use the 
same model for all MOS. While such a solution approach has the disadvantage of 
suboptimizing our prediction capability at the MOS level, it has the large advantage of 
permitting a rcasonably valid comparison of the relative effectiveness of SRD across a 
group of MOS. 

From the perspective of the user, the overall model approach offers two other 
distinct advantages. First, it offers simplicity. The managers who will be responsible 
to implement and maintain this model are not operations analysts and will resist 
integrating a complicated model / procedure into an already busy schedule. Second, an 
overall model offers credibility. It would be very difficult to explain to non-analysts 
why a particular carrier, sav Consumer Price Index, is pertinent to the reenlistment 
decision of a soldier in one MOS, but not in another. 

An outline of the steps included in our modelling and solution approach follows. 
It is consistent with a methodology recommended by Draper and Smith [Ref. 4: p. 
414]. 


1 ШІЛДЕ! the problem. Select a response variable. Suggest relevant carrier 
variables. 


2 Can we obtain a complete set of observations on all specified carrier. variables 
and the selected response variable? If not, return to step (1). Otherwise, 
continue. 


3 Establish model goals. Consider the minimum / maximum number of included 
carrier variables desired and determine the desired level of statistical significance 
for the estimated coeiiicicnts of eae. 


4 Бш a correlation matrix. Guard against including carriers which are highly 
correiated. 


5 Conduct indep ndeni ший linear stepwise regression analysis for each MOS 
included. in the study. [Examine the residuals for support of the model 
assumptions. Are the models adequate? If not, return to step (1). Otherwise, 
continue. 


6 Propose an overall linear regression model. 
Conduct ordinary least-squares multiple linear regression analysis for each MOS 
included, in the study. Examine the residuals for support of the model 
assumptions. Is the model adequate? If not, return to step (6). Otherwise, 
continue. 


5 Are the coefficients reasonable? Is the model plausible? Is the equation usable? 
If not, return to step (1) or (6) as appropriate. 
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ЕТРР СРРЖТЭ ХЭ т Г1ОХ5 

Some further assumptions should be addressed. We assume that an individual's 
propensity 10 reenlist is a function of many variables, both personal and 
environmental. We assume that it is possible to formulate a mathematical model 
which estimates the propensity of individuals to reenlist at the MOS level. While this 
assuinption is driven by a user requirement for an MOS level model, it is not an 
unreasonable one. The assumption implies that individuals in the same MOS behave 
similarly with respect to the factors which affect their reenlistment decision. It also 
allows that soldiers in different MOS may have different perceptions of the 
environment in which they make their reenlistment decision. These implications can be 
justified with respect to the Enlisted Personnel Management System (EPMS). The 
duties and training required of each MOS are associated with different civilian skills. 
Also, the general qualifications and skills of the MOS subpopulations are sorted at 
enlistment. For example. the mean Armed Forces Qualification Test (AF QT) score for 
one MOS is not the same. nor is it intended to be the same, as anv other MOS. EPMS 
establishes the MOS as the basic unit of personnel inventory management. It is not 
only the required level, but also the logical Ievel at which to conduct this study. 

We must also assume for the purposes of this study that EPMS remains 
relatively stable. Further, we assume that the socio-economic environment in which 
the soldier makes a reenlistment decision is stable (within the norms established in the 


historic scope of this study). 


ОБ ОВИЕ 

This thesis formulates and develops a mathematical model which explains the 
Valiation im zone A retention at the MOS level. In Chapter II, a brief overview of the 
SRB program is presented. In Chapter II], the assumptions and analysis leading to the 
development of an overall model are explained. In Chapter IV, the results of fitting 
the proposed overall model to the available data are presented and discussed. Finally, 


Chapter V includes the conclusions and recommendations of this study. 


СЛИВО ОКО Ом КСО LANGUAGES AND STATISTICAL PACKAGES. 

АП programming associated with data collection and manipulation was 
completed using FORTRAN 77 code. All data analvsis and most graphics were 
completed using the SAS, version V, statistical package. These choices were made with 


respect to the current capabilities and assets of the Military Personnel Center. 
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П. THE SELECTIVE REENLISTMENT BONUS PROGRAM 


This Chapter presents a brief overview of the Selective. Reenlistment. Bonus 
(SRB) program. Criteria for including MOS in the program are outlined, as are the 
cligibility requirements and payment procedures. Finally, the budget history of the 


program is graphically summarized. 


A- THE OLJECIMINE 
The Selective Reenlistment Bonus program is designed to offer an attractive 


reenlistment Incentive to improve manning in critical military specialties. 


В. CRITERIA FOR INCLUDING MOS IN TIE SRB PROGRAM 
As has been previously noted, there are many criteria. considered before 
including, or excluding an MOS from the SRB program. Among these factors arc: 
1 а comparison of carcer manning requirements with projected inventory, 
the cost of formal school training for replacement personnel, 
the expected increase in retention as a result of inclusion m the SRB program, 


the priority of MOS in terms of it’s essentiality to the Army mission, 


Сл Ја W N 


the inherent unattractiveness of the MOS with respect to other military and 
civilian occupations. 


С. ZONES OR СЕРЕН 
There are three zones of individual SRB cligibility. They are: 
| zone A, which applies to those service members who have completed, at least 21 
months of continuous active duty but not more than 6 years of active duty on 
the day of reenlistment. 


2 zone D, which applies to those service members who have completed at сая: 6 
ut no more than 10 years of active duty on the day of rcenlistinent. 


J zone C, which applies to those service members who have completed at Icast 10 
but no more than 14 years of active duty on the day of reenlistment. 


D. THE AMOUNT OF BONUS AND METITOD Or PAYS? 
]. Amount of Bonus 
The reenlistment bonus to which a service member is entitled upon 


reenlistment is computed as follows: 
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SRB = (monthiy base pay) x (years of additional obligated service) (2.1) 


x (SRB level) 


Where the SRB multiplier can assume values of 0, 1, 2, 3, 4. ог 5. No more than one 
SRB is authorized per soldier per zone. No SRB can exceed $20,000.00. 

2. Method of Payment 

Upon qualification for award of an SRB, a service member receives 50% of 

the authorized SRB on the day of reenlistment, and the balance in equal annual 
installments on the anniversary of the reenlistment during the reenlistment contract 
period. | 
ЕЕЕ ЕЕЕ ЕСТІСЕ CRITERIA FOR ENLISTED SERVICE 

MEMBERS. 

The individual eligibility criteria for service members 1s as prescribed in. Army 
Regulation (AR) 600-200 and AR 601-250. 


Poe У МЕХ Teese RIENCE 

As 1s indicated above, the amount of the SRB award to which an individual 15 
entitled is a function of three factors: SRB award level, individual monthly base pay, 
and vears of additional obligated service incurred as a result of the contract. The two 
following graphics are included to provide the reader with a feel for the scope of the 
problem. At Figure 2.1, the horizontal axis lists fiscal years while the vertical axis ts 
s sn c sur Ee Otal number ol Zonc A SRB takers for cach year. At l'igure 2.2, 
the horizontal axis again represents fiscal years, but the vertical axis represents the 
total zone A SRB expenditures for each year. We note that both bonus takers and 
expenditures were at a low point in FY83. We note also that while the total number of 
zone A bonus takers has increased over the last 2 years, the total expenditures have 
not. The underlying cause of this trend is that, in general, reenlistment bonuses are 


available to more eligible soldiers, but at a lower level. 


BONUS TAKERS 
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ШІ. MODEL FORMULATION 


In this Chapter, the assumptions and analysis leading to the development of an 
overall model are explained. First, the basic multiple linear regression model is 
proposed in matrix notation. Then a response variable and a set of candidate carrier 
variables are suggested. A sampling period is defined for use in estimating parameters 
associated with the proposed variables. The problems encountered in data collection 
and data preparation are discussed. The results of independent stepwise regression 
analysis on each of the included MOS are explained. Finally, an overall multiple linear 


regression model is proposed. 


А. PROPOSED LINEAR MODEL IN MATRIX ГОКМ 

In this thesis, we assume that there exists a relationship between the propensity 
of a soldier to reenlist and that soldier’s perception of the environment. A reliable 
method of analysis to examine the nature of the relationship between our proposed 
response variable (some measure of retention rate) and our candidate carrier variables 
(which will attempt to account for changes in the makeup or environment of the 
reenlistment (decision-maker) 15 the method of least squares, or regression analysis. 
Using this method of analysis, we will attempt to fit the following multiple linear 


regression model to the data we collect for cach MOS: 


У = XÜ + £ Ва 


where: 
Y 1s an (n x 1) vector of observations on the selected response variable 
X ts an (n x p) matrix of observations on the selected carrier variables 
D is a (p x I) vector of parameters to be estimated 


M 2 
2 15 ап (n x 1) vector of errors assuined to have the distribution N( 0, o“I ) 


It is shown [Ref. 4: pp. 86-87] that if X’X is non-singular, the least squares 


estimate of f, call it b, can be written as: 
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ь = (ХХХ ee) 


with varlance-covariance matrix (ХХ) 152, Thus, the variance associated with 


estimating any particular coefficient ts given by: 
V(b;) = с.62 Ga) 
1 11 і 


where c;; is the diagonal element in (x'xy! corresponding to ith variable. Further, a 


prediction of Y at Xo Is given by: 

AN я | 

ШЕ- Ух (3.4) 
with variance given by: 


A 
ND) S XOU 2) (3.5) 


B. SELECTION OF THE RESPONSE VARIABLE 

We have assumed that MOS subpopulations can be treated as discrete groups 
with respect to their propensity to reenlist. Therefore, it follows that if the variables 
relevant to the reenlistment decision were known, and their levels could be fixed, or 
considered fixed for a period of time, the reenlistment propensity of these discrete 
groups could also be considered fixed. Let us assume that these propensities are 
probabilities. Then, since a soldier either does (1) or does not (0) reenlist, over a 
period of time we will observe outcomes on repeated bernoulli trials with fixed 
parameter p. 

If we further assume these observations are independent, then we can use the 
maximum likelihood estimator for parameter p (D = number of reenlistments observed 
/ number of trials). Hence, one method for obtaining an estimate of the reenlistiment 
propensity for a given MOS is to observe outcomes on the reenlistment decision for a 
period of time short enough so that relevant conditrons may be fixed or considered 
fixed, yet long enough to obtain a sample size which will enable us to discern small 


changes in the population parameter. 
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The purpose of the SRB program, as stated in Chapter II, is to improve manning 
in critical military specialties. An SRB can be considered effective in 2 ways. First, an 
SRB can induce a soldier to rcenlist for his own MOS, who may otherwise have left the 
service. Second, it can induce a soldier to reenlist for his own MOS, who may 
otherwise have reenlisted for training in another specialty. In conjunction with 
program managers at MILPERCEN, the following retention (vice reclassification) rate 


has been developed for use as the response variable in this study: 
Y = retention rate = propensity of a soldier to reenlist for his own MOS. 
It is estimated by: 


ж . . . . . . 
Y = estimated retention rate — number of soldiers rcenlistng for their own MOS / 


number of soldiers eligible to do so. 


Obviously excluded from our estimator $ (not included in either numerator or 
denominator expressions) are service members who are not fully eligible for 
reenlistment at the decision point. An SRB cannot induce an otherwise ineligible 
soldier to reenlist. Also excluded are reenlistments which occur outside the window of 
eligibility (6 months for first term soldiers, 3 months otherwise) and all extensions. 
These actions, while not independent of the effects of the SRB program. occur for 
exceptional reasons unrelated to tlie SRB award level. Soldiers who reenlist, but 
reclassify in conjunction with reenlistment, are not counted in the numerator of our 
estimator, but are included in the denominator. 

Retention data is available at the individual soldier level on mass storage at 
MILPERCEN. However, owing to significant changes in the manner in which these 
data were recorded prior to fiscal year 1981, earlicr data are not. readily available. A 
magnetic tape, containing information pertinent to the reenlistment or separation of 
soldiers during the period 1 Oct 81 through 30 Sep 85, was provided by MILPERCEN 
to support this study. Excluded from this tape were transactions concerning service 
members outside of the three SRB zones, or who otherwise fell into an excluded 
category as described in the previous paragraph. [n all, more than 481,000 individual 


records were included in the file. 
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Р ОУСОРТНЕЁ СА КВІРВ VARIABLES 
l. SRB Level | 
SRB level is the earrier variable of interest in this study. It exists at one of 6 
diserete levels for all MOS, for all zones, at all times. These levels are 0, 1, 2, 3, 4, 
апа 5. Reeord of the SRB history for eaeh MOS is not eurrently avaiiable in machine 
readable form, but hardeopy reeords were made available by the MILPERCEN 
program managers dating baek to 1974. 
2. Endogenous Variables 
The endogenous variables, for the purposes of this study, are those variables 
which provide information on the demographie eomposition of the discrete groups 
themselves. For each record eontained on the data tape provided by MILPERCEN, 
the following demographic data are reeorded: 
AFQT score, 


eivilian edueation level, 


МО = 


SCX, 


qu 63 


number of dependents, 
Prace. 
It 1s our intention in recording these data, to eonstruet variables which may be 
included in the overall regression model to eontrol for the effects of population 
dynamies. 
3. Exogenous Variables 
Unemployment rate is ineluded as a statistic. which is visible to the 
reenlistment deeision-maker and may represent one quantitative. measure of the 
soldier's eareer alternatives. This data is readily available in the Employment and 
Earnings Monthly, published by the Bureau of Labor Statistics (BLS). The data is 
summarized by oeeupational classification and region. Since most Army skills do not 
readily fall into any of the BLS elassifications, our statistic of ehoiec 1s the seasonalized 
aggregate unemployment rate. 
Consumer Price Index (CPI), as a measure of the ehange in the spending 
power of the soldier, is also considered a vital statistie. Data 15 again available on a 
monthly basis in the BLS published CPI Detailed Report. The statistic most relevant 
for our uses is the seasonalized statistic for all urban consumers. 
Pay seale changes are believed to be at least as important as CPI. Considered 


with CPI, a measure of the real ehange in a soldier’s purehasing power ean be derived. 
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Promotion opportunity to pay grades E5 and E6 is considered very important. 
Variables which account for the change in promotion opportunity at the MOS level 
were of specific concern to the MILPERCEN program managers. Our problem here 
however, is to identify a measure visible to the reenlistment decision-maker and for 
which a reliable historic record exists. The monthly published promotion cut-off scores 
were an immiediate choice as an explicit and simple indicator of relative promotion 
opportunity, but MILPERCEN promotion program managers have maintained no 
data older than 2 years. As an alternative, it was decided to include a statistic reported 
on the monthly DCSPER 411, Enlisted Strength Report, available on microfiche only. 
The statistic, mean time in service at promotion for those promoted, in the previous 12 
months, reports a 12 month promotion moving point average for both grades at the 
MOS level. This statistic is included, as it 15 believed. that a soldier making a 
reenlistment decision is sensitive to the effects changes in promotion policy have on the 


careers of those around lim. 


D. "SELECTION OF оо ЕРЕКЕ 

As has been mentioned, our data collection capability is linuted to the five fiscal 
years from FYS1 through FY$85. A change in the manner in which loss data was 
recorded precludes our obtaining reliable data on earlier records. 

Inasmuch as we plan to observe outcomes on the reenlistment decision over a 
period of time during which the levels of the independent variables included in our 
regression model are considered fixed, we must decide upon a sample period. An 
immediately attractive alternative is the fiscal quarter for several reasons. First, the 
SRB program is managed in accordance with a quarterlv cycle. Second, several of our 
data (such as the promotion statistics) are reported at quarterly intervals. Third, 
several of our data (such as CPI) are much more stable at the quarter level. 

Analysis was conducted to determine the appropriate sainple size of cligibles 
required to ensure that a reliable base of MOS and zone specilic retention rate 
estimates was obtained. Specifically, we wish our sample size to bc large enough so 
that 90% of the time our estimate Y is within 10% of the true parameter Y= Then 
using an approximate 90% confidence interval for for the Bernoulli parameter Y 
[Ref. 5: pp. 394-395], we can compute the minimum number of observations, n, 
required to satisfy our requirement. The approximate 90% confidence interval can be 


written as: 
^ 
Р(Ү - 1.645(% (1-Фуп) 2 < у < % + 1.645 (1-Фуп) 2 ) = .90 
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| | 222 
The variance of the estimate 15 maximized with Y = 0.5, 
Р(.5 - 1.645(.25/п)/2 < Ү < .5 + 1.645(.25/п)1/2) = .90 


: : ^. 225 
We see that to be 90% confident that our estimate Y is within 10% of the true 


parameter Y, it must be truc that: 
1.645(.25/) ? < 10 

Solving the above equation for n, we find that: 
їйє 68 


We next require each MOS included in our analysis to have at Їса5! 08 zonc A 
rcenlistment outcomes per quarter for no fewer than 14 of the 20 quarters of data 
available. We will refer to such MOS as high density. In addition we require that the 
MOS be authorized as of the end of FY85 and that it have an active SRB history in 
our period of study. That is, there must be at least one change in SRB level during the 
data period. When these requirements are imposed, the number of MOS included in 
our analvsis 1s reduced from an initial 374 to 24. These MOS arc listed in Table 1. 

Consider the SRD budget history summarized at [igure 2.2. While the number 
of MOS included in our analysis represents only 6.4% of the total MOS in the 
inventory, during the 5 year pcriod of our study, these 24 MOS accounted for over 
34% of the zone A reenlistments and over 60% of the total zone A bonus budgct 
outlays. With these facts in mind, we will pursue our development of a zone A 
retention model using only the 24 high density MOS. In doing so, we make the 
following observations: 

| Тре developed model should be accurate for the 24 high density MOS. 

2 Inasmuch as the model will account, for over 34% of the total zone А 
reenlistments in the Army, it 1s very likely to be reasonably accurate for tne 
moderate densitv MOS in thc inventorv. (A moderately dense MOS 15 опе for 
which at least 17, but less than 68, outcomes pon quarter can be observed [ог по 
fewer than 14 of the 20 quarters of data available for our study. The requirement 
for 17 observations allows us 9075 confidence that our, estimator is within 20% 
of the true retention rate, Y.) An application of the developed model to those 
МОЗ5 will not be unjustified. 

3 [t mav not be possible to adequatelv represent the retention behavior of all low 
density MOS with, an. overall model. By. their nature, they arc managed 
exceptionally. Their group . 26:10) ог 02010 aliect their 
reenlistment decision will not likely be similar to that of anv other MOS БІЗШЕ 
Efforts to group these low density MOS, creating artificial high. density sample 


cells, as has been done in several studies by both CAA and Rand Corporation 
(including those previously referenced), must be well documented and controlled. 
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TABLE 1 
| 


МОЗ [NC HIGH IN THIS ANALYSIS 


J] 
ITY) 


HIGH DENS 


MOS ТИЙН 


ИВ Infantryman 

ИС Indirect Fire Infantryman 

HH Heavy Anti-armor Weapon Infantryman 
12B Combat Engineer 

12C Bridge Crewmai 

13B Cannon Crewmember 

I3E Cannon Fire Direction Control Specialist 
13F Fire Support Specialist 

16R ADA Short Range Gunnery Crew Member 
165 MANPADS Crewmember 

190 Cavalry Scout 

19Е M48-M60 Armor Crewmember 

31M Multichannel Commo Equip Operator 
31V Tactical Commo Equip Operator 

518 Carpentry / Masonry Specialist 

SAE NBC Specialist 

63B Light Wheel Vehicle Mechanic 

63H Track Vehicle Repairer 

63N M60A1/A3 Tank System Mechanic 

63T Bradley FVS Mechanic 

63W Wheel Vehicle Repairer 

720 Telecommunications Center Operator 


76W Petroleum Supply Specialist 


82C Field Artillery Surveyor 
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lt is acknowledged here that our approach to the sample size problem is very 
conservative. We will show in Chapter 1V, that actual results from applying our 
proposed lincar model to available data for high density MOS, can yield 90% 


confidence intervals which are considerably shorter than (+/-)10%. 


E. DATA PREPARATION 

The zone A SRB level in effect for each MOS and for each quarter is included in 
the candidate carrier variable data set (as variable SRB) without modification. An 
additional variable, SRBSQ (S RB?) 15 also included to account for the possible 
nonlinear effects of the SRB program on retention. 

The FORTRAN code which was used to develop retention rates (response 
variable REUP) and other rates associated with the endogenous variable set is included 
at Appendix A. The retention rate algorithm is straightforward and consistent with the 
rules set forth in section B of this Chapter. The endogenous carrier variables are 
defined for cach of the 24 MOS and for each of the 20 quarters as follows: 

| AFQT : eligible population scoring less than 50 on the AFQT / total eligible, 


2 CIVED : eligible population completing at least 12 years of formal education / 
total eligible, 


3 SEX : eligible females / total eligible, 

4 DEP: eligible population with more than 2 dependents / total eligibles, 

5 RACE : eligible non-caucasians / total eligible. 
Initial demographic rate definitions were suggested by retention program managers at 
MILPERCEN. The final definitions reported above were developed through a trial 
and error process. These definitions were found to provide the most meaningful 
description of an eligible population. 

A variable named REAL was constructed as a linear combination of the СРІ апа 
the annual рау raise received by the service. member. Specifically, 
Fae = Yapayraise - CP. The variable was considered as a carrier because we 
found that it adequately accounted for the changes in the soldier's purchasing power, 
while consuming one fewer model degrees of freedom. 

The E5 and E6 promotion opportunity variables included in the candidate carrier 
variable set were constructed as follows: 

| ESTEST2 : mean time in service (TIS) at promotion to grade E5 for those 


promoted in the previous 12 months (MOS level) / mean TIS at promotion to 
grade ES for those promoted in the previous 12 months (Army level} 


2 ЕбТЕЅТ2 : mean TIS at promotion to grade E6 for those promoted in the 
previous 12 months (MOS level) / mean TIS at promotion to grade LO for 
those promoted in the previous 12 months (Army level). 
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We expect to find that ES and E6 promotion opportunity (here, measured relative to 
an Army average) are effective retention incentives. That is, as the relative opportunity 
for promotion in a particular MOS is enhanced, so should the retention rate be 
enhanced, given the levels of all other factors are unchanged. | 

The seasonally adjusted unemployment rate (UNEMPLY) is included in the 
candidate variable set without modification. 

Our earlicst analysis of the data provided by MILPERCEN indicates the 
existance of a strong seasonal trend in retention. Figure 3.1 graphically depicts this 
trend. The solid line represents the aggregate estimated retention rate for all MOS 
which were not included in the SRB program during our period of analysis. The 
broken line represents the aggregate estimated retention rate for all MOS which were 
included in the SRB program during our period of analysis. 

Three observations can immediately be made. First, the aggregate trends are 
very similar. Second, despite the inducement of a bonus, MOS included in the SRB 
program tend to have lower rates of retention than those not included. Third, and 
most importantly, it 1s evident that we could capture a good deal of the seasonality by 
including the variables QTR (representing the actual fiscal quarter associated with cach 
data point and taking on values 1, 2, 3 or 4) and QTRSQ ( QTR? ) in the candidate 
variable data set. A variable or set of variables which accurately accounts for an effect 
such as seasonality is preferred to an explicit representation of the cause when, such as 


in our case, the result is a large reduction in model degrees of freedom. 


Е. THESTEPWISEREGRESSIORIMIODEL 

Stepwise regression 1s a method of building a multiple linear regression model 
using only the best independent carrier variables. In stepwise regression, we first 
construct a first order linear regression model using onlv that independent variable 
which is most highly correlated with the designated response variable. We check the 
results of an overall F-test to determine if our regression is significant at some 
pre-selected level. If not, we discontinue our analysis and select Q - Y as our best 
predictor. Otherwise, we retain that initial variable in our model and search for a 
second significant carrier variable to enter the regression. The partial correlations of 
each of the remaining candidate carrier variables with the response variable are 
examined and the variable with the highest partial correlation is added to the 


regression. The partial F-statistics of each carrier variable included in the model are 


26 


4 
‘ 
' 
‘ 
' 
' 
' 
' 
1 
П 
' 
' 
' 
' 
1 
1 
! 
' 
' 
3 
' 
4 
| 
Li 
в 
! 
і 
| 
4 
4 


--“ 
„= 
- 
QS a ww ew weet we on ow oe one === 


- 
- 


== == «== = ш «в һе из» ав ах Cone 


12 
FISCAL QUARTER 





Figure 3.1 The Seasonality of Retention 
(Bonus and Non-bonus MOS) 
examined and compared to a pre-selected acceptance level. [f they are both significant, 
they are retained and a third candidate carrier ts proposed. Otherwise, we eliminate the 
non-significant carrier(s) from the regression model and identify the next best 
candidate. This process is continued until the set of variables included in the model 
cannot be altered at the pre-selected significance level. |Ке!. 4: pp. 306, 312]. 

The correlation matrix of the response variable and each of the candidate carrier 
variables for all data in our data set (24 MOS x 20 observations per MOS = 480 
observations) is at Appendix B. Note that the variable SRB is more highly correlated 
with the response variable REUP than any other. There do not appear to be anv 


dangerous correlations among the candidate carriers at the aggregate level. Recall, we 


wish to guard against any singularity or near singularity of the X’X matrix. 


An example of an input data set for MOS 63B is at Appendix C. Note that 
variables SRBSQ and QTRSQ do not appear, as they are constructed in the modelling 
process. An example of the output from a SAS STEPWISE procedure is at Appendix 
D. Precise instructions for interpreting this output are contained in [Ref. 6] and 
(Ref. 7: pp. 761-774]. The SAS commands which were used to generate this output are 
included in Appendix С. 


G. RESULTS OFTEE SIEPWISE AP ADB X 5 

We summarize the results of our stepwise analysis in three ways. First, we 
examine the results of each regression to determine which carrier variables had 
estimated regression cocíficients which were reasonably and consistently signed and 
significant at the .15 acceptance level most often. Then, as a measure of the total 
variation in retention rate explained by our model, we examine the R statistic for all 
MOS included in our analysis. Finally as a measure of goodness of fit, we exanine 
Niallows Ср statistic for all MOS included in our analysis. After we have proposed 
and applied an ovcrall model, a more detailed analysis of model residuals is presented 
in Chapter IV. 

|. Significant Carriers 

In Table 2, each candidate carrier variable is listed. The pair SRB* / SRBSQ* 
and the pair QTR* / QTRSQ* are also included and will be used to record the event 
that doth carriers were considered significant for a particular MOS. For example, if 
SRB and SRBSQ are both included for soie MOS, an observation will not be recorded 
for the carriers SRB and SRBSQ. Instead an observation will be recorded for both 
УКВ” and SRBSQ*. Observations for SRB and SRBSQ (or QTR and QTRSQ) are 
recorded only when they are un-paired. An observation for anv candidate variable is 
recorded when the variable has been included included in the stepwise model at the .15 
level of significance. The manner of record chosen ( + / - ) indicates the sign of the 
estimated coefficient. 

We note in Table 2 that the SRB* / SRBSQ* pair is not often significant while 
the QTR* / QTRSQ* pair is. However, we also note that the variables SRB or 
SRBSQ, or their pair, are considered significant in 17 of the 24 individual models 
examined. Other variables which appear to be excellent carrier candidates are RACE, 
DEP and REAL. 


28 


I [Po 
0. РГО РШ) 
АХСЕ ) 





LEVEL 
CARRIER RESULTS 
SRB + + + + + + + + + + + 
SRBSQ + + + + + 
SRB* n | 
SRBSQ* = 
QTR - 
QTRSQ - - - 
mp + + + + - + + + + 
ШЕКТЕР т ного» ооо се 
КАСЕ + + + + + + + + 


DEP 
EDUCATE 
AFQT 
EEST? 


EGTEST2 
UNEMPLY 
REAL 





2. The R° Statistic 

A commonly accepted statistic for measuring the value of a regression 
equation is the R statistic. The R statistic actually measures the proportion of total 
variation about the mean, Y, which is accounted for by the regression. We are 
cautious in using this statistic, because it can be made arbitrarily high. by adding 
different, albeit meaningless carriers [Ref. 4: p. 33]. 

With this caution in mind, the results of our RŽ analysis are sunimarized in 
Figure 3.2. The horizontal axis is grouped into R2 bins of width 0.1, while the vertical 


axis represents the number of occurances. 
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3. The Mallows C р Statistic 

Another popular statistic for measuring the goodness of fit for a proposed 
model is the Ср statistic developed by С. L. Mallows [Ref. 4; pp. 299, 303). [Ще 
expected value of the statistic 1s approximately the number of independent carriers 
included in the regression model plus the intercept term (p). Extraordinarily high 
values of the Ср statistic indicate that our model suffers considerably from lack of fit; 
that is, our residuals are composed of both random and systematic components. In 
our analysis of the given data, we find that three of the proposed regression models 
obtained via the stepwise procedure suffer from lack of fit. They are the models 
associated with the MOS listed in Table 3. We will pay particular.attention to these 


MOS in attempting to fit an overall model. 


H. THE PROPOSED OYERALCMØDEL 


The proposed overall model, based on the requirements of the study and the 


previous analysis, is as follows: 
, - r n 
Y = ротор А s (3.6) 


+ РАХ з mL ae BoXs + € 


where: 
Y = retention rate (as previously defined) 
УКР 
огсон 
сг В 
Dep РЕГ 
= | 
€ = error component with assumed distribution N( 0, с? ) 


and D is a vector of the parameters to be estimated. 
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Figure 3.2 Distribution of RŽ Values 
(Stepwise Procedure) 


TABEE 3 


ñ LACK OF TIT MODET: TR 
(FROM THE STEPWISE PROCEDURE) 
MOS Ср Statistic 
ПОВ 27725 
31М 36.76 
51В 35 224 
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Pett ДОМЕ А„КЕГЕМШОМ МОРЕ 


In this Chapter, ordinary Icast squares multiple linear regression analysis is used 
to fit the overall model proposed in Chapter III to the data available for the high 
density MOS. The results of this analysis are discussed in terms of carrier significance 
and the RÊ statistic. An examination of the residuals is performed to investigate 
suspected model inadequacies. The model is then fit to data available for the moderate 
density MOS. The results of this analysis are bricfly summarized and potential data 
transformations are discussed. A demonstration of the uses of this model in both a 
preaictive and comparative mode is presented. Finally, alternatives for modelling low 


density MOS are suggested. 


ool OVERALL MODEL FITTED TO HIGH DENSITY MOS 


The overall model, as proposed in the previous Chapter, 1s as follows: 
42-17: 42 10 (4.1) 


+ Ваха 9 fsX4 + ВХ + = 


where: 
Y = retention rate (as previously defined) 
Зон SIND 
га-г 
До Баса 
да = БЕП 
Цэ REAL 
= = error component with assumed distribution N( 0. с? ) 


and D is a vector of the parameters to be estimated. 

In applying ordinary least squares linear regression analysis to our data, we 
recognize that we have 20 unadjusted degrees of freedom (df) available for each. MOS 
(via our 20 quarterly observations on the response and carrier variables). Our 
proposed model requires | df for the intercept estimate, bp, and 6 df for the proposed 


carrier variables, leaving 13 df for error. While no hard and fast rules exist for the 


optimal distribution of available df in the development of a linear model, a good rule is 
to keep the model degrees of freedom (in our case, 7) small relative to the total 
available degrees of freedom. This is a particularly good rule when the modcl degrees 
of freedom are limited, as they are in our analysis. 

The proposed overall model was fitted to the data available for the 24 high 
density MOS. The SAS commands which were used to generate our output are 
included at Appendix G. A copy of our output for example MOS 63B 1s at Appendix 
E 

We can easily summarize our results of this analysis in a manner similar to that 
used for our stepwise analysis in the previous Chapter. First, we cxamine the estimated 
cocfficients of each carrier for each MOS to determine which were most often 
consistent and most often significant. We note that our results for the included carriers 
may well differ from the results we obtained for those same carriers in our stepwise 
procedure. Despite our efforts to select candidate carriers which were unrelated, it is 
very possible that for a particular MOS, a carrier which was included (AFQT, for 
example) in the stepwise model served as a proxy [Ref. 3: p. 317] for some carrier which 
was not included (say, DEP). Since DEP 1s included in the overall model, and AFQT 
is not, it would not be surprising if DEP were to suddenly become significant at the .15 
level in our current analysis, even though it was rejected at that same level in our 
Stepwise analysis. This phenomenon is a consequence of our resolve to develop an 
overall model. 

After our estimated coefficient analysis, we will present an R2 statistic summary, 
similar to that presented in Chapter III. 

1. Significant Carriers 

In Table 4, a summary of the results in terms of significant carriers using 
ordinary least squares multiple regression analysis is presented. The saine definitions 
for QTR* and QTRSQ* apply as in Chapter IIl; that is, they represent paired 
observations on the variables QTR and QTRSQ. We notice that our results from this 
analysis are very similar to those summarized at Table 2 for the stepwise analysis for 
all variables except REAL. Previously, REAL was significant at the .15 acceptance 
level a total of 10 times. In our current analysis, it is significant [8 times, or as many 
times as the variable SRB is significant. 

At the individual MOS level, we can compare our output for MOS 63B via the 


stepwise procedure (Appendix D) to the output generated when the overall model was 


ТАВІ 4 
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fitted (Appendix E). We note that the carriers which were considered significant via 
the stepwise analysis, and which were also included in the overall model, remain 
significant. Carrier variables DEP and REAL, which were not considered significant 
via the stepwise procedure, are also not significant at the .15 level m our current 
analysis, although their estimated regression coefficients are. signed as expected. The 
gencral effect of using an overall model, vice an MOS specific model, in this case is not 
great. The В.2 statistic has been reduced from .93 to .$7, and the overall significance 
level of the regression has been slightly increased, owing to a slightly larger error mean 
square value. 

Note that a critical point made carlicr in this thesis is supported by our 
current analysis. The estimate of an individual regression coefficient is dependent, in 
varying degrees, on it's costock. The estimate b,, with costock including E5TEST?2 
ande m РЕ (ма the stepwise procedure) is valued at .255. With ESTEST2 and 
VINE ely removed, and with DEP and REAL included, the estimate by is Increased 
to .304. While this difference may seem slight (and és with respect to the standard 
error of the estimate), it could be a very significant difference if this coefficient 1s used 


as a point estimate of the effectiveness factor (as discussed earlier). 
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2. The R Statistic 

In Figure 4.1, we note that our lowest observed R? value is in the .65 bin, 
whereas in our stepwise summary at Figure 3.2, it was in the .45 bin (an improvement 
in the distribution of the R“ values). We note also that the number of observations in 
mene 27) bin las been гей са from 4 in Figure 3.2 то 1 in Figure 4.1. 9 We have 
examined a case in the previous section wherein Ше R* value moved from the .95 bin 
to the .85 bin (MOS 63D). MOS 16R is an example of an MOS which moved from the 
.45 bin to the .65 bin in our analysis. 

The actual difference in RÊ values for MOS I6R is .60 - 48 = .12. In the 
stepwise procedure, only DEP and QTRSQ were included as significant carriers (at the 
.15 level of acceptance). When the overall model was fitted to the data available for 
MOS 16R, the other 4 carrier variables were not significant at the .15 acceptance level, 
but all were signed as we expect, and some variables, such as SRB, were significant at 
only slightly higher levels. In all, while the R^ statistic was increased for this MOS. 
and the sum of squares due to regression was increased, the overall significance of the 


regression was slightly reduced by the inclusion of the non-significant carricr terms. 


B. EXAMINATION OF RESIDUALS 

Our residual analysis associated with fitting the proposed overall model to the 
data available for the 24 high density MOS is summarized in the 4 graphics below. 
The residuals of the 24 MOS were examined independently during the analvsis phase of 
this study, but are here presented in an aggregate manner with enhanced effect. 

In conducting a residual analysis, we are examining the validity of the. model 
assumptions concerning the observed errors; that 15, that they are independent, have a 
Q mean, have a constant variance, and follow a normal distribution. At the conclusion 
of our analysis, we should observe that cither our model assumptions appear to be 
violated or they do not appear so. [Ref. 4: pp. 141-142]. 

1, The Frequency Plot 

In Figure 4.2, we present a horizontal bar chart of the residuals, from 
--3to + .3 in bins of width OL The distribution of these residuals should appear 
symmetric (specifically, bell shaped), and centered on 0. No contradiction to our 


normality assumption 1s evident here. 


EN 


2. The Plot against Fitted Values 
In Figure 4.3, we present a plot of the residuals verses the fitted values 
associated with them. We hope to find no regular pattern in the residuals; that 1s, if 
our model assumptions are correct, the distribution of the residuals 15 independent of 
the fitted values. No contradiction to this assumption 18 evident. 
3. The Plot against Time Sequence 
As in the plot against the fitted values, we should observe no patterns of 
significance in the plot of residuals verses sequence of observation. In Figure 4.4, while 
we note a tendency for positive valued residuals associated with observations 6 and 9, 
they are not abnormally low or high and no regular patterns are discernable. 
4. The Serial Correlation Plots 
In Figures 4.5 and 4.6, we test for Lag-1 and Lag-4 serial correlation 
respectively. Jf our observed errors are pairwise uncorrelated, then a cloud centered on 
coordinate (0, 0) should be the only discernable pattern. The Lag-4 plot is suggested 
by our suspicion that some seasonality effects remain, even after the addition of the 
ОТВ and QTRSQ variables to our overall model. It is seen that our suspicions aie 
unfounded. 
With these results in hand, we are prepared to accept our modelling 
assumptions as reasonable. The SAS conunands which were used to produce all the 


previous residual graphics are included at Appendix G. 


С. THE OVERALL MODEL FITIED TOSXIODERATT DPMS! iy S 

We now have an opportunity to verify our proposed overall model with a fresh 
data set. From among the remaining MOS, we selected 50 moderate density MOS for 
which we have record of an active SRB history during the fiscal years 1981-1985. Data 
for these MOS were gathered in the same manner as for the 24 high density MOS. 
The proposed linear inodei was fitted to these data and the results from the 50 
independent fittings are summarized, in the aggregate, as follows: 

1. Significant Carriers 

The primary carrier vanable of interest, SRB, continues to serve as an 

excellent predictor variable. In our current analysis, it is significant at the .15 
acceptance level in 27 of the 50 moderate density models. The pair QTR and QTRSQ 
were also included as significant in 27 of 50 cases. The carriers RACE, REAL and 


DEP were not considered to be as significant as often (14, 14 and И times 
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Figure 4.4 Residuals vs. Time Sequence 
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Figure 4.5 Residual Lag-1 Serial Correlation 
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Figure 4.6 Residual Lag-4 Serial Correlation 
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respectively), but their estimated coefficients were consistently signed (always positive) 


and were frequently significant at levels just above the .15 acceptance threshold. 
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Figure 4.7 Distribution of R2 Values 
(Regressicn Procedure - Moderate Density MOS) 
2. The R? Statistic 
At Figure 4.7, the distribution. of р.2 values, obtained from fitting the 
proposed overall model to the data available for the 50 moderate density MOS, 1s 
plotted, as previously, with a bar chart. Two points are worth noting with respect to 
Figure 4.7. First, as measured in terms of the R^ statistic, our proposed overall model 
continues to serve us well in explaining the variation in retention rate through time at 


the MOS level. Second, the distribution of observations on the R^ statistic for 


44 


modcrate density MOS seems to be more highly spread than the R? distribution for 
high density MOS. This phenomenon is not unexpected when the smaller sample sizes 
associated with the moderate density MOS are considered. If our proposed overall 
model is correct, the decreased level of precision with which we can measure outcomes 
on the response variable, Y, will cause a general increase in the variability of the R? 
Statistic, and a general decrease in it’s mean value. 

Our error term € in the overall model actually accounts for the simultaneous 
effect of errors from several sources. The first, and most obvious source, is our 
inability to know or measure all factors which are critical to the reenlistment decision 
for all soldiers. А second significant source is our inability to measure the true 
response variable, Y. Recall, we estimate the zone A retention rate of a particular 


MOS for a particular quarter with: 


A 
Y = number of soldiers reenlisting for thcir own MOS / number of soldiers eligible 


Гоар 50. 


We have shown that the variance of the estimate generally increases with decreasing 
sample size. IIowcver for a particular MOS, if the general size of the sample can bc 
considered stable in our period of studv, then this measurement error is simply 
absorbed in the error term 6, without effect on the modelling assumptions. To the 
extent that the RÊ statistic can be thought of as the ratio of the variation in the data 
around Y explained by the regression, to total variation in the data around Y (which 
includes the variation accounted for by the error terim), the decrease in the mean R2 
outcome, and increase in variability, are expected for the lower density MOS. [Ref. 8: 
pp. 93-94]. 
3. Residual Analysis 

An extensive analysis of aggregate residual plots 1s not presented here because 
the results are very similar to the results we obtained when the overall model was fitted 
to the data for high density MOS. ‘Опе plot which is worthy of note however, is the 
plot of residuals vs. sequence of observation at Figure 4.8. In our earlier analysis of 
residuals for high density MOS, we noted that residuals for quarters 6 and 9 appeared 
to be skewed positive. We note that for residuals associated with fitting the overall 
model to data for the 50 moderate density MOS, this perceived. skewing 15 ПОЇ 
apparent. This observation lessens our concern that our error term contains systematic 


and biasing components. 
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Figure 4.8 Residuals vs. Time Sequence 
(Moderate Density MOS) 


D. DATA TRANSFORMATIONS 
It is standard practice in regression analysis to consider variance stabilizing 
transformations, such as the arcsin transformation, when the response variable is a 


parameter cstimated by proportional data [Ref. 4: pp. 236-240]. Such a transformation 
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is considered because proportional type data typically do not have a uniform variance; 
the estimated variance of the data is dependent on the rate itself. However, these 
transformations are not used when the value of the estimated rates are in the range 
(0.3 - 0.7 ). In this range, the most common variance stabilizing transformations are 
nearly linear, and the dependence of the sample variance on the estimated rate is 
minimal. In the graph at Figure 4.3, we see no evidence which warrants a variance 
stabilizing transformation of our data. The overall model without transformation is 


believed best suited to the needs of our intended user. 


E. A DEMONSTRATION OF MODEL USE 
We have shown in Chapter III that, given our model is correct, a prediction of Y 


at Xo Is given by: 

о (4.2) 
with variance given by: 

О Gy CPG A (4.3) 


b. ; 2 E 
Using the error mean square term as our best estimate оГ 67, we can construct a 90% 


confidence interval for the true mean value of Y at XQ as follows: 
T. GHI 1.771 Mec ole n 4.4 
са! ХО) (4.4) 


where s represents the square root of error mean square. 

To demonstrate the use of this model, we have arbitrarily selected MOS 1111 (ог 
the purpose of conducting sensitivity analysis. The value of the R^ statistic when the 
overall model was fitted is .7283, and the variables QTRSQ, SRB, and REAL are 
significant at the .15 level of acceptance. Analysis of the residuals reveals no 
significant departure from normality. 

To perform our analysis, we again resort to the SAS statistical software package. 
Тһе ( ХХ у! matrix, the estimated regression coefficients and the error mean square, 
calculated using PROC REG, were printed to an output file. The computational 


formulas shown in equations 4.2, 4.3, and 4.4 were added to this file, and it was 
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prepared as an input file to the SAS PROC MATRIX routine. Copies of the input 
files and output files involved in this procedure are at Appendix F. 

The 7 dimensional vector of values on the independent variables at which we 
wish to predict an outcome for the dependent variable Y is represented by Xo. Let us 
hypothesize an Хо value of (1, 2, 4, 0, 0.4, 0.25, 3.0), where the first position of the 
vector is reserved for the unity multiplier of the intercept term and the remaining 
values represent outcomes on the independent variables QTR, QTRSQ, SRB, RACE, 
DEP, and REAL respectively. The 90% confidence interval on the true mean value of 
Ү аг Хү are shown on the first line at Table 4. The 90% confidence intervals on the 
true mean value of Y at Xo when the hypothesized value of SRD is changed to levels 1, 


2, and 3 are shown on lines 2, 3, and 4 of Table 4 respectively. 


TABLE 5 
SENSITIVITY ANALYYSI TORMI O THE 


SRB Level ов Үр . 90 UB s.e.(Predict) 
0 -301 . 426 вої . 0707 
1 2276 .475 . 574 290561 
2 . 433 . 524 . 615 ‚ ОБТ 
3 . 469 2510 голд . 0585 


In the results summarized at Table 4, we observe two phenomena. lirst, and as 
expected, the value of б increases at a steady rate of .049 with cach unit increase in 
SRB level (.049 is the value of bj). Second, and more importantly however, note the 
behavior of the standard error of the prediction ( s.e. (Predict) - the square root of our 
“(%) term). It decreases through SRB level 2 and increases thereafter. This behavior 
is the result of our moving closer to the center of the sample data space. As we move 
further from the center of the sample data space, reliance on a point estimate for the 
response variable is increasingly dangerous. If we attempt to extrapolate beyond our 
sampie data space, we can have very little confidence in the validity of our point 


prediction [Ref. 4: p. 8]. 


We note also that the widths of the 90% confidence intervals defined above can 
be approximately represented as Va (С-/-) 10%. When MOS for which the overall 
model provided a better fit were considered (such as MOS 63D), these confidence 
intervals were more nearly approximated by us ЕЗУ 

It is not a simple matter to measure 6 dimensional data spaces. For our uses 
however, it is a simple enough matter to ensure that any sensitivity analysis conducted 
with respect to any particular independent variable, or combination of independent 
variables, remains in the range of values defined by the sample data space for those 
variables. In general, when the the independent variables are unrelated, the bulk of the 
potential problems associated with prediction-are avoided if the sensitivity analysis is 
conducted within the individual value ranges of the independent variables. 

We must be particularly careful when the estimated coefficient of any carrier 
variable, such as SRB, is interpreted as the effect of varving the level of the associated 
variable while the other values are unchanged. Even when that variable is unrelated to 
it’s costock, the range of values for which such an interpretation is valid, as described 
by the sample data space, should be respected. This is best shown by example. 

At Appendix E, we have examined the model parameter estimates for MOS 635. 
We note that the coefficient for carrier variable SRB is estimated as .304. This 
estimate 1s based on a sample data range of (0, 1) for the variable SRB. Clearly, it is 
NOemeasOlable tO Use this estimate as an eliectiveness cocllicient at SB levels 2, 3 or 
Јо ет то а 0075 9090, or higher increase in retention rate over the SRB level 0 
rate). Alternatives for prediction and comparison when we wish to extrapolate beyond 


our data space are described in the next section. 


ПО ОО Є ГТ ГМО STRATEGIES 

In developing the overall model, we considered a data base representing 24 high 
density MOS, which were authorized as of 30 September 1985, and for which an active 
SRB history existed during our period of analysis. We then fit the proposed overall 
model to 50 inoderate density MOS with active SRB histories to verify our modelling 
assumptions. Based on our preceding analysis, we propose that the overall model be 
extended for general use in explaining the variation in zone A retention behavior for all 
MOS. We acknowledge however, that as the density associated with an MOS 
decreases, so does our ability to maintain small confidence intervals about our 


parameter and prediction estimates. As stated earlier, this is a consequence of 
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including the additional imprecision associated with our measurement of Y in the crror 
term £. If, in our examination of residuals however, we find no reason to discount our 
modelling assumptions. and no intuitive rcason exists to discount these assumptions, 
then there is no reason to belicve a better model exists. 

In the event that the model suffers grossly from lack of fit, or other factors exist 
which cast doubt on the applicability of the model to a particular MOS, use of this 
model in a predictive procedure for that MOS is not advised. This situation is most 
likely to occur in fitting the model to data associated with very low density, highly 
technical MOS. In such a case, it is advisable to construct and maintain an MOS 
specific predictive model. Any inter-MOS comparison of the estimated coefficients of 
like carrier variables should not include these unique specialues. 

Suggestions for using the developed overall model under extraordinary 
circumstances follow. 

1. Modelling anew MOS 

Typically, when a new MOS is introduced, personnel are reclassified from 
some other specialty, which is in turn reduced in size or eliminated. A pscudo-historic 
data base for the new MOS can then be created by tncluding the records of the 
individual reenlistment decisions and SRB histories applicable to soldiers in the losing 
MOS. 

2. Modelling a Low Density MOS 

When the sample sizes involved in a very low density MOS are so small that 
acceptablely reliable estimates of the regression paramcters cannot be attained, but tlie 
model is believed adequate, then it is recommended that the estimated cocflicients of a 
like MOS, for which an adequate sample size 1s available, be used in retention rate 
prediction. This alternative is suggested in preference to grouping these low density 
MOS for two reasons. First, an explicit decision is made by the SRB program 
manager, as to which MOS can best represent the MOS of concern in retention rate 
projection, With the group method, we averacemiie sellectsmol Severqie ИШЕ 
intuitive that our results with a single most similar MOS should be better. Second, we 
need not develop imaginative ways to group MOS unique factors, such as SRB level, 
across many MOS. 

5. Extrapolating Beyond the Sample Data Space. 

If the extrapolation is not too distant from the sample data space and docs 


not involve extrapolating the SRB level, then it is recommended that we use the 
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developed model without modification, making clear our concern over the increasing 
danger of using a point prediction. If the extrapolation does involve the SRB variable, 
or the extrapolation is far beyond the data space described by our available data in any 
dimension, then selection of a like MOS, with a data space accommodating our needs, 


is recommended for use in analysis. 


y 


V. CONCLUSIONS AND RECOMMENDATIONS 


In this thesis, the problem of developing a predictive model which explains the 
variation in zone A enlisted retention rates at the MOS level is formulated and solved 
using stepwise and ordinary least squares linear regression analysis. Inasmuch as the 
principle use of this model will be in the management of the SRB program, SRB level 
was initially included as a candidate carrier variable. Two other categories of candidate 
carrier variables were also included. The endogenous variables represent a demographic 
profile of an eligible reenlistment population. The exogenous variables represent the 
alternate career opportunities as perceived by the reenlistment decision-maker. This 
approach represents a significant improvement over earlier efforts to solve thus 
problem, in that a capability to include a demographic profile of the eligible 
populations was not previously available to the analyst. 

= To allow for the inter- MOS comparison of the estimated regression coefficients 
associated with the SRB variable, an overall projection model, applicable to all MOS, 
was developed. We selected 24 high density MOS, which had active SRB histories in 
our sample period, to include in our initial analysis. Stepwise multiple linear regression 
analysis was used to find a best overall explanatory model, which could be used to 


project retention at the MOS level. The proposed overall model follows: 


where: 
Y = retention rate 
дү ОВ Черен 
Xə = fiscal quarter 
Ха т rate representing the race profile of an eligible population 
Хд = rate representing the dependent profile of an eligible population 
А5 = rate representing the real change in a soldier’s pay through time 
Є = error component with assumed distribution N( 0, с? ) 


and D is a vector of the parameters to be estimated. 
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We note that the X4 variable is included to account for the effects of the 
observed seasonal behavior in the retention rate. We note also that no variable is 
included in the proposed overall model which accounts for the effects of promotion 
program management. 

Personnel inventory managers at MILPERCEN view the Army promotion 
program as a force alignment tool in the same way that accession and reclassification 
programs are viewed. Promotion opportunity to grades ES and E6 are managed at the 
MOS level with the intention of providing incentives (or disincentives) for zone A 
soldiers to reenlist for their entry MOS. In not including an independent variable in 
our proposed overall model to account for this mechanism, we make no conclusions as 
to it’s effectiveness, but we do conclude that the statistic provided us to measure it's 
eflect 1s inadequate for that purpose. The measure preferred by the MILPERCEN 
program managers, promotion cut-off score, was unavailable during the period of our 
analvsis. We recommend that an analysis similar to the one described in this report, 
including the promotion cut-off scores, be performed when a sufficient base of historic 
records are available. 

We selected 50 moderate density MOS, which had active SRB histories in our 
sample period, to include іп our validation analysis. Our results from this analysis were 
very favorable. We recommend the proposed model for use іп predicting retention for 
all MIOS with the following caveats: 


] Care must be taken in extrapolating beyond the region defined by our sample 
space. 


2 Reliance on point estimates for retention become increasingly dangerous as the 
density of the MOS decreases. 


3 When the estimated regression coefficient of the SRB. variable is, interpreted as 
the effect of varving the SRB level while the level of all other, factors remains 
unchanged, the range of values for which that interpretation is valid must. be 
Ке Реге 
When the regression coefficients cannot be reliably estimated from the available 

data, we recommend the use of the estimated regression coefficients of a like and more 
reliably modelled MOS. We prefer this alternative to the method of creating MOS 
groups, as has been done in past studies, for two reasons. First, a decision 1s explicitly 
made by the SRB program manager, as to which particular MOS model can best 
represent the MOS of concern. Second, the problems associated with grouping MOS 


unique data, such as SRD level, are avoided. 
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The actual estimated regression coefficients developed for each MOS in our 
analysis have not been included in this report. Instead, this analysis has been 
conducted using only those programming languages and analytic software available to 
the DCSPLANS, MILPERCEN, Force Plans Branch. All program code required to 
implement the analytic processes described in this report are included as appendices 
and referenced as appropriate. 

It is recommended that the regression coefficients be estimated on a periodic 
basis using the programs and procedures described in Chapter IV of this report. If it 
becomes apparent that the overall model is no longer adequate, either through 
examination of the residuals or because some measureable factor not included in the 
overall model has become critical to the reenlistment decision ( as could occur with a 
change to the EPMS ), then we recommend that a zone A retention model be newly 


developed following the procedures set forth in Chapter I!I of this study. 


APPENDIX A 
FORTRAN PROGRAM TO PRODUCE DEMOGRAPHIC RATES 
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КЕЦР 
SRB 
RACE 
DEP 

SEX 
EDUCATE 
AFQT 

Е | "712 
ВЕ Го 
QTR 
UNEMPLY 
REAL 


REUP 
SRB 
RACE 
DEP 

SEX 
EDUCATE 
AFQT 
ESTEST2 
Е6ТЕ572 
QTR 
UNEMPLY 
REAL 


Oo O C о к 


REUP 


. 00000 
. 40106 
. 34487 
. 24321 
„да 
.16155 
„1975 


0.01142 


.08781 
.32714 


0.17660 
0.25425 


EST ESI 


. 01142 
239358 
„21260 
. 06718 
. 27534 
‚ 12471 
‚ 16466 
. 00000 
129986 
. 00004 
. 00011 
. 00010 


CORRELATION MATRIX 


SRB 


. 40106 
. 00000 
#0719 
807.3606 
. 19467 
290109 
21111239 
259950 
2596135 
"0157 
. 21680 
210198 


БІЗШШЕ5 І2 


303781 
. 30615 
4292116 
202697 
851507 
. 11160 
213838 
32096 
. 00000 
. 00005 
. 00007 
. 00008 


APPENDIN B 


RACE 


. 34487 
SIDES 
. 00000 
. 10544 
. 52904 
200509 
296051 
. 21260 
2 110 
10:10 55 
12921] 
‚ 14387 


ОТК 


‚32714 
· 06157 
-01095 
. 29468 
. 02192 
. 21847 
.00070 
. 00004 
. 00005 
. 00000 
02818 
. 00000 
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ОЕР 


0. 24321 
0.07366 


‚ 10544 
. 00000 
БЕЛІК 
‚01708 
‚ 01983 
‚ 06718 
.12697 
. 29468 
.09619 
.10925 


UNEMPLY 


. 17660 
. 21680 
12027 
09619 
‚07544 
ОЛИ 
517133 
1011 
.00007 
20255 
. 00000 
20222) 


SEX EDUCATE 


„6337 
. 19467 
. 52904 
. 19403 
. 00000 
225955 
. 00782 
‚27534 
11 
192122 
. 07544 
. 05044 


REAL 


225425 
. 10155 
. 14387 
«10925 
.05044 
. 24003 
. 06349 
. 00010 
. 00008 
. 00000 


0672229 
1. 00000 


16155 
“00109 
"00502 
. 01708 
25550 
. 00000 
293314 
.12471 
.11160 
. 21847 
281271 
. 24003 


AFQT 


10 
"о 
55951 


0. 01983 
0. 00782 


‚33314 
. 00000 
.15466 
19996 
. 00070 
12744 
‚06349 
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СОГАЦОСО ОУ СУЦОКО СЈСОСО КЕ иде ом осоол 
WW WOU OUOMMOWWWMRIO Mm On LD 


eec ont Ще е. пер оро ее с 
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REAL 


UNEMPLY 
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АРРЕМРІХ р 
SAMPLE OUTPUT FILE - SAS PROC STEPWISE 


MOS=63B 


STEPWISE REGRESSION PROCEDURE FOR DEPENDENT VARIABLE REUP 
Q. 40205418 


ТЕРІ VARIABLE EDUCATE ENTERED 


R SQUARE = 


PROB>F 
0. 0027 


РКОВ> I 


0.0027 


PROB>F 
0. 0012 


РКОВ>Ғ 
0. 0007 


РКОВ>Е 


__С(Р) = 45. 63109799 
DF SUM OF SQUARES MEAN SQUARE F 
REGRESSION 1 0. 09995765 0. 09995765 122111 
ERROR 18 0. 14865970 00825887 
TOTAL 19 0. 24861735 
B VALUE STD ERROR TYPE II SS F 
INTERCEPT 1. 52788549 
EDUCATE -1.30962992 0.37644450 0. 09995765 12119 
BOUNDS ON CONDITION NUMBER: 1 2 
STEP 2 VARIABLE REAL ENTERED R SQUARE =_0.54491298 
С(Р) = 32.90644519 
ОЕ SUM OF SQUARES MEAN SQUARE F 
REGRESSION 2 0. 13547482 0. 06773741 10.18 
ERROR 17 0. 11314253 0. 00665544 
TOTAL 19 0. 24861735 
B VALUE STD ERROR TYPE II SS F 
INTERCEPT 1. 54759367 
EDUCATE -1. 36639281 0. 33882394 0. 10823804 16. 26 
REAL 0.01207975 0. 00522910 0. 03551718 5.34 
BOUNDS ON CONDITION NUMBER: 1. 005287, 8. 042296 
STEP 3 VARIABLE QTRSQ ENTERED R SQUARE = 0. 64745038 
C(P) = 24. 33777424 
DF SUM OF SQUARES MEAN SQUARE F 
REGRESSION 3 0. 16096740 0. 05365580 9.79 
ЕВВОВ 16 0. 08764995 0. 00547812 
TOTAL 19 0. 24861735 
B VALUE STD ERROR TYPE II SS F 
INTERCEPT 1. 27877948 
EDUCATE -0. 98469643 35468525 04222311 Z 71 
QIRQ -0. 00725385 0. 00336262 0. 02549258 4. 65 
EAL 0. 01165255 00474824 03299198 6. 02 
BOUNDS ON CONDITION NUMBER: 1. 338361 22. 06034 
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BOUNDS ON CONDITION NUMBER: 


PROB>F 
PROB>F 


Е 
Е 


MEAN SQUARE 
ТҮРЕ-10 00 


SUM OF SQUARES 
STO ERROR 


VARIABLE QTR ENTERED 
DF 


ШЕР 7 


.- е е ә . 


OLO HO r 
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PROB>F 
PROE>F 
PROS?F 
РКОВ> Е 
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824. 5504 
592. 6314 


MEAN SQUARE 
MEAN SQUARE 


ТҮРЕ 5 
ТҮРЕ 5 


39.11734, 
52571779 


510 ЕККОК 
STD ERROR 


SUM OF SQUARES 
SUM OF SQUARES 


B VALUE 
B VALUE 


DF 


VARIABLE EDUCATE REMOVED 
DF 


VARIABLE ESTEST2 ENTERED 


BOUNDS ON CONDITION NUMBER: 
STERZO 


BOUNDS ON CONDITION NUMBER: 


STERT 


СГС с о 


• е е е е 


өл ө. ò ФІ 9 


е о е + + ° 


758. 1488 


36. 40594, 
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BOUNDS ON CONDITION NUMBER: 


PROB>F 
PROB>F 
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DF 


ЭТРЕР 10 


CJUO cto HN 
CO —<f =<! 


Се чоо ОС 


• e е е е е е 


ОВ 


r 
— 
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К VARIABLES MET THE 0.1500 SIGNIFICANCE LEVEL FOR ENTRY 


- 
— 


NC OTHE 


STEL IBNTER 


SUMMARY OF STEPWISE R 


BOUNDS ON CCND 


чў ОО СМО ОГК. 4&0 0 
Ао COQJP COCh c1 
сосъсосчсчелгс-косасо 
Оооо С} 10040 0n 0 
LONI HMO MMO H 
Со ССС) | 


Сом О СУсО СПСО СОСО<+Ь 
Се ГМО ОГУ СЈО 
ОҒ Ом ООО < 
СРДО ДО ДОО Г~СОСО ОпОЗ 


— > (ето (сто (ат ста (ана ева, 


обгон ОГ 


(кюю Д «Эв (жю и 


cACOU CO SF CO) «SFUO SF UOSO 


Lt] 

t— 

«T 

— Fe 

<< = 

ti) о 

се ШІ 
tt] N> 
I— H 
ЕС су «ла 
Сэгс! 113: 
«oco mo i 
OW ct PZ 


сасмсэкх За Ог- сСООЭСО 
rt 


г ~ ПОСОРАСЈСОГ~ С 
CQUCOULOOn DAO cir r- st 
Coco SE QJAOCOQAO cir 
C D D Sct 


PROB>F 


EGGE 


F 


COL O—I CN O Јен со 
MOO MAA AHA ASML 
СОСООМОСОСОСЈУУ Се 
ес О-О ч СУ 


СОО СМСмэГ- CJ СМ 
rH rot 


REMOVED 
REAL 
EDUCATE 


АСОС) Та ОГОО ОЗО 
= 


РКОВ> Е 
0.0001 


27092 
UPPER957 
MEAN 


Ч 


F VALUE 
о/ 


95 
EA 


LOWE 


APPENDIX E 


SAMPLE OUTPUT FILE - SAS PROC REG 
ANALYSIS OF VARIANCE 


PARAMETER ESTIMATES 


DF 


SOURCE 


DEP VARIABLE: REUP 
OBS 


VARIABLE ODF 
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APPENDIX T 
SAMPLE INPUT / OUTPUT FILES - SAS PROC MATRIX 
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INPUT 


X CX MG eX Ye x x x x X X K x X X X xxx 


С014 


Х'ХІМУ 


0.114193 


ROW] 


-0. 0222765 


ROW2 


0. 00705693 


ROW3 


0210918 


ROW4 


-0. 407743 


ROWS 


-0.585478 


ROW6 


0. 00624465 


ROW7 


COLI 


ХО 


-_ СМЕО <ги со 


COL4 


пи 


0.049136/ 


COLI 
Q. 00610136 


EMS 


КОМІ 


201! 
ПР) 


TCRIT 


КОМІ 
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ҮО 
КОМІ 


VAR(YO) 
ROW] 


CI( LOW) 
КОМ 


CI(HIGH) 
КОМІ 
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YO 
ROW] 


VAR( YO) 
ROWI 


CIC(LOW) 
ROW] 


CI(HIGH) 
ROW] 
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ҮО 
ROW] 


VAR(YO) 
КОМІ 


CI(LOW) 
ROW] 


O JI l > SRE LEVER = 0 


COL] 
0.425821 


COL1 
0. 0050044 


COLI 
0. 300537 


COL1 
о о 5 


ШИЕ 27 SRB CEVEL= 1] 


COLI 
0. 474958 


oe) 
0. 00314623 


COLI 
(257502 


С011 
07574295 


OUTPUTS = SSRDSESVEL 2-2 


CORI 
0. 524094 


COLI 
0.00262035 


COL] 
0. 433438 
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* o Xe ove oe ove oe veo x x eo ove xe ove oe ode e * x < 


* o Xe eo oe e eoe de oe ooo е ++ 


CI(HIGH) COE 


ROW1 0.614751 
y oco ve ole oe * x í í í í í w * * * QUTPUT 4 ох SRB LEVEL = 3 o o oe ve se oe eoe ve x x eoe x x x e ve ye e 
YO 011 
ROW1 0.573231 
VAR(YO) 011 
ROW1 0. 00342676 
CI(LOW) COLI 
ROW 0. 469559 
CI(HIGH) COLI 
ROW] 0. 676903 
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STUDY 
ATE AFOT TERM ESTESTI EGTESTI 
SEQ YEAR; 
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APPENDIX G 
EXTRACT OF SAS V5 PROGRAMMING COMMANDS USED IN THIS 


(include data arrays) 
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DATA |АСА; 
MERGE 00141 00142: 
PROC PLGT 
DATA-LAGI; 
PLOT ТЭ 12-139 / МАЕЕ=0 НАЕЕ=0; 
PROC PLOT 
DATA=LAG4 
PLOT RAl*R42-! *! / VREF=0 HREF=0; 
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A Multiple linear regression model for p 
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